Characterizing Shared-Memory Applications: A Case Study of the NAS Parallel Benchmarks
نویسنده
چکیده
The objective of this report is to present our characterization of a shared-memory implementation of the NAS Parallel Benchmarks (NPB). This characterization is needed to support the design decisions of future shared-memory multiprocessors. This report presents two sets of characterization data; the rst set is the application characteristics that do not change from one hardware connguration to another, and the second set is the traac characteristics of the application when run on a possible future hardware connguration. The data presented in this report include characterization of NPB communication, sharing, and cache behavior.
منابع مشابه
Mptrace: Characterizing Physical Memory Usage for Chip Multiprocessors
The performance of high performance computing applications depends highly on how they are implemented. However, their runtime behavior is tightly coupled with the resources they are allocated such as in which cores the application threads run or in which memory devices their memory space is placed. Thus, depending on their characteristics, applications may exhibit more affinity to specific type...
متن کاملParallelization of NAS Benchmarks for Shared Memory Multiprocessore
This paper presents our experiences of parallelizing the sequential implementation of NAS benchmarks using compiler directives on SGI Origin2000 distributed shared memory (DSM) system. Porting existing applications to new high performance parallel and distributed computing platforms is a challenging task. Ideally, a user develops a sequential version of the application, leaving the task of port...
متن کاملParallelization of NAS Benchmarks for Shared Memory Multiprocessors
This paper presents our experiences of parallelizing the sequential implementation of NAS benchmarks using compiler directives on SGI Origin2000 distributed shared memory (DSM) system. Porting existing applications to new high performance parallel and distributed computing platforms is a challenging task. Ideally, a user develops a sequential version of the application, leaving the task of port...
متن کاملCharacterizing Application Performance Sensitivity to Resource Contention in Multicore Architectures
Contention for shared resources in the memory hierarchy can have a profound effect on the performance of applications running on high-end computers based on commodity multicore microprocessors. In this paper we describe a methodology, based on differential performance analysis, to quantify the effect of this contention on parallel applications. In particular, we characterize the contention for ...
متن کاملExtending an Application-Level Checkpointing Tool to Provide Fault Tolerance Support to OpenMP Applications
Despite the increasing popularity of shared-memory systems, there is a lack of tools for providing fault tolerance support to shared-memory applications. CPPC (ComPiler for Portable Checkpointing) is an application-level checkpointing tool focused on the insertion of fault tolerance into long-running MPI applications. This paper presents an extension to CPPC to allow the checkpointing of OpenMP...
متن کامل